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ABSTRACT 


Threat vehicle identification (TVI) is a key task in reducing fratricide on the 
battlefield. Military skills such as TVI are susceptible to memory decay. This 
research investigates the factors that are the best predictors of performance 
decrement in a TVI task. Thirty active-duty officers were randomly assigned to 
one of five groups of six. Each group was trained on vehicle identification using 
the U.S. Army’s Recognition of Combat Vehicles (ROC-V). All participants 
trained on 10 thermal and 10 visible vehicle images and reached a training 
proficiency of at least 90 percent on training post-tests. Each group was 
assigned a day when they would return to retake the post-tests. The groups 
returned 1, 2, 4, 8, and 16 days later. Participants also completed a recognition 
memory test to assess their individual memory levels. The results of this 
research indicate that memory does not decay exponentially for the TVI task. 
However, participants performed worse on the thermal image set than on the 
visible image set. Performance on the recognition memory test and time to 
complete training were significant predictors of performance on the TVI task. 
Results of this study could help shape TVI training plans and reduce the risk of 
fratricide. 
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EXECUTIVE SUMMARY 


Recent technological advances have provided today’s warfighters with robust 
capabilities on the battlefield. With the proliferation of fighting occurring during 
limited visibility, U.S. warfighters have become dependent upon forward-looking 
infrared (FLIR) technology to allow them to identify and engage targets. While 
FLIR has been a combat multiplier, incidents of fratricide remain costly for the 
U.S. During Operation Desert Storm, approximately 80% of the M1 Abrams Main 
Battle Tanks and M2 Bradley Fighting Vehicles lost were destroyed by friendly 
fire (Harmeyer & Antal, 1992). Furthermore, 90% of combat engagements 
occurred using infrared and thermal technology (O’Connor & O’Kane, 2000). 
The need to train thermal imagery is a necessity to reduce fratricide given the 
preponderance of engagements occurring in limited visibility (Dyer, Westergren, 
Shorter, & Brown, 1997). This study investigates memory decay and what 
factors are the best predictors of performance decrement in a threat vehicle 
identification (TVI) task. 

For this study, a laboratory experiment was conducted to assess 
recognition memory ability, memory decay, and performance on a TVI task using 
thermal and visible imagery on the U.S. Army’s Recognition of Combat Vehicles 
(ROC-V) program. The research used a 2x5 mixed design to analyze 
performance between thermal and visible images over time. Thirty active-duty 
Army, Marine Corps, Navy, and Air Force officers took a recognition memory test 
(RMT). After 72-hours had elapsed, participants returned to begin self-guided 
ROC-V training and to retake the RMT to establish baseline memory abilities. 
Participants trained on two vehicle sets. The first set consisted of 10 thermal 
vehicle images, and the second set consisted of 10 different visible vehicle 
images. Participants had to achieve a score of 90% on training quizzes and final 
training post-tests for each vehicle set to be considered trained and capable of 
moving forward in the study. Participants were randomly placed in one of five 
groups that consisted of six people. Each group returned on a second test day to 
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retake the thermal and visible post-tests. Group one returned after one day. 
Group 2 returned two days after training. Group 3 returned four days after training. 
Group 4 returned after eight days, and group 5 returned after 16 days. Each group 
also took an additional RMT on their second test day. Participants also completed 
a sleep log for the duration of their involvement in the experiment. 

Analysis from the study showed that the effect of time passing on 
performance for the thermal and visible post-tests was not significant. However, 
the time that it took participants to complete the training module was significant. 
The longer it took a participant to complete the training, the worse their 
performance was on the thermal post-test. Participants’ performance on the 
RMT that occurred 72-hours after the initial RMT was a significant predictor of 
performance on the TVI task. There was a significant difference between the 
thermal and visible image sets across all participants. Scores for the thermal test 
were significantly lower than scores for the visible test. Scores for the visible test 
were high and did not have significant differences when analyzed against all 
other factors. Participants who had previous TVI training performed significantly 
better than those who did not. 

The results of this research can help design training plans and increase 
operational performance. Results indicate that performance on a TVI task is 
higher for individuals with previous vehicle identification training. Therefore, 
ensuring that all warfighters are introduced to TVI in their careers can help 
mitigate the potential for fratricide. Significantly lower performance on thermal 
images still highlights the need to train more on this specific task, especially with 
the majority of combat engagements occurring during limited visibility. 
Warfighters’ individual memory abilities are predictors of their performance in a 
TVI task. Commanders should be able to identify personnel that take more time 
to complete TVI training modules and that have lower recognition memory 
abilities. This will allow leaders to ensure they have identified individuals who 
need additional training and allocate resources as necessary to increase their 
unit’s performance. 
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I. INTRODUCTION 


A. PROBLEM STATEMENT 

Military tasks differ in how quickly they are forgotten (Hagman & Rose, 
1983). The relevant literature identifies certain factors contribute to decay rate 
such as time and individual memory ability. The purpose of this study is to 
examine the factors that predict decay rate skills previously learned by 
warfighters. Minimizing decay rate for warfighters can increase their 
performance while decreasing training costs and resource expenditure. 

Threat vehicle identification (TVI) is a task that is evaluated through a 
gunnery skills test (GST) in accordance with Army Field Manual 3-20.21, Heavy 
Brigade Combat Team (HBCT) Gunnery. Crew members must complete the GST 
to be considered qualified to operate their stations during live-fire gunnery 
qualification. Training TVI is necessary to reduce fratricide. The TVI test is 
administered with a two-dimensional presentation interface using either analog or 
digital delivery methods. Recently, the Army developed a computer-based 
training application. Recognition of Combat Vehicles (ROC-V), to aid in soldiers’ 
learning. ROC-V (2013) “helps soldiers learn to identify the thermal signatures of 
combat vehicles through the use of an interactive curriculum that teaches the 
unique ‘hotspots,’ and overall shapes, characteristics, and capabilities for 276 
U.S., Allied, and foreign vehicles.” These skills are important given that 90% of 
the engagements during Operation Desert Storm occurred using thermal or night 
vision sights (O’Connor & O’Kane, 2000). 

Fratricide is the misuse of friendly force against friendly personnel that 
results in death or injury (Department of the Army, 1992). Soldiers encounter 
stressful combat conditions that necessitate quick decision-making when faced 
with identifying a friend or foe. These high-pressure situations can greatly 
increase the probability for fratricide. Americans destroyed approximately 80% 
(27 out of 35) of the Ml Abrams tanks and M2 Bradley Fighting Vehicles 
(Harmeyer & Antal, 1992) lost during combat operations in Operation Desert 
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storm. The U.S. Army experienced 55 instances of fratricide during the Global 
War on Terror (GWOT) from 11 September 2001 to 30 March 2008 that resulted 
in 30 fatalities (Webb & Hewett, 2010). 

Blue Force Tracker (BFT) and identification, friend and foe (IFF) signals 
are technological assets that assist warfighters in identifying targets. Doctrine, 
rules of engagement (ROE), tactics, combat identification panels (CIP), and 
situation awareness also inform the identification process. Even with these tools, 
the identification and final decision of whether a vehicle is friendly or adversarial 
is still made by a human (Briggs & Goldberg, 1995). Visual recognition is a key 
factor that contributes to target identification. Visual recognition is often the last 
sensory input process that happens before a warfighter makes the final decision 
to engage a target (Webb & Hewett, 2010). Therefore, it is important that training 
identification of threats is emphasized to complement technology in order to 
reduce fratricide. 

Soldiers reach a high level of skill proficiency upon completion of initial 
entry and advanced individual training. Performance decrement occurs during 
the period where no practice occurs between transfer from school to an 
operational unit (Knerr et al., 1984). The greater the lag between transfer of 
school to unit, the greater the memory decay. Soldiers must then be retrained 
once they get to their operational units in order to restore their high level of 
proficiency (Hagman & Rose, 1983). Hagman et al. also noted that opportunities 
for refresher training are limited due to resource constraints. A recent All 
Services Combat Identification Evaluation Team (ASCIET) report found that 
levels of proficiency were raised to greater than 90% accuracy using ROC-V 
(O’Connor, 2002). However, no studies have examined the rates of forgetting 
after using ROC-V. 

Clark and Paivio (1991) proposed a dual-coding theory in which memory 
is enhanced by both semantic and visual encoding, since either can lead to 
recall. They propose that imagery facilitates memory because it provides a 
second kind of memory code, and two codes are better than one. This supports 
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the ASCIET report on ROC-V since visual encoding and semantic descriptions of 
vehicles help an operator’s encoding of thermal vehicle signatures. McCauley, 
Eskes, and Moscovitch (1996) also found that the use of mental imagery could 
enhance memory in many situations. 

Memory decay has been studied in great detail (Stahl et al., 2010). 
Ebbinghaus (1913) first hypothesized that forgetting was an exponential function. 
The concept of a forgetting curve was derived from his work. The forgetting 
curve plots retention and forgetting as a function of time (see Figure 1). 
Ebbinghaus asserted that in order to decrease the effects of time on memory 
retention as plotted on the curve, repetitive overlearning of information would 
cause the forgetting curve to become shallower. Ebbinghaus’ research suggests 
that each rehearsal makes the forgetting curve less steep and restores 
performance to asymptotic stability. Ebbinghaus, however, conducted the 
experiment such that he was both the participant and the experimenter and the 
forgetting curve was based solely on his own individual results. Finkenbinder 
(1913) challenged Ebbinghaus’ work when he reported that the forgetting curve 
was slower than suggested by Ebbinghaus. 
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Figure 1. The Ebbinghaus Forgetting Curve (from Stahl et al., 2010) 
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Researchers have proposed multiple explanations for why humans forget 
and different strategies to improve retention. Brown and Craik (2000) found that 
when new information is learned, subsequent forgetting might occur as a result of 
ineffective or inappropriate encoding. Altmann and Gray (2002) suggested that 
memory decays over time unless it is actively maintained. 

Bailey (1989) conducted an industrial-based experiment in which he 
determined forgetting was a multiplicative function of the amount of learning 
accrued and the log of the interruption period. He also found that since there is 
no constant component of forgetting, an intercept term near zero was 
theoretically correct and valid for the purposes of modeling forgetting rate. Bailey 
concluded that forgetting rates for continuous process tasks were negligible while 
forgetting rates for procedural tasks were significant. He used bicycle riding as 
an example of a continuous process and operating a computer program as an 
example of a procedural or discrete interval task. In Bailey’s taxonomy, 
identifying thermal signatures of stationary armored vehicles from multiple two- 
dimensional angles would be considered a procedural task. 

Measures to gauge forgetting include recall and recognition (Lockhart, 
1992). Recall and recognition differ in that recall is retrieval without cues 
whereas recognition relies on assistance through various cuing techniques. 
Information retrieval using recall is more difficult than retrieval using recognition 
(Lockhart, 2000). The task of identifying a vehicle through thermal sights is 
based on a crewmember’s recognition of previously learned thermal patterns. 
Hagman et al. (1983) determined that critical skills are difficult for the Army as a 
whole to maintain due to prolonged time between training events. 

B. RESEARCH OBJECTIVES 

This study investigated how the passage of time and presentation of 
training material affected a subject’s ability to recognize military vehicles. The 
specific objectives of this research are: 
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• Analyze the accuracy of TVI task responses to determine whether 
there is a difference in performance based on testing interval. 

• Analyze TVI task performance to determine any difference in 
accuracy between two types of imagery: visual and thermal. 

• Assess which factors best predict performance decrement in a TVI 
task. 

C. RESEARCH QUESTIONS 

The research questions addressed by this study are; 

• What factors are the best predictors of decrements in training 
proficiency in a threat vehicle identification task? 

• Does the presentation of images (thermal or visual) influence an 
individual’s ability to recognize a military vehicle? 

• Does an individual’s memory ability impact decay rate? 

D. HUMAN SYSTEMS INTEGRATION 

The Manpower and Personnel Integration (MANPRINT) Program is the 
U.S. Army’s execution of Department of Defense’s (DOD’s) Human Systems 
Integration (HSI). MANPRINT is “comprehensive management and technical 
program that focuses on the integration of human considerations into the system 
acquisition process” (Manpower and Personnel Integration [MANPRINT], 2005, 
p. 1). The program was started in order to place an emphasis on the human in a 
system’s total performance. The goal of the MANPRINT program is to enhance 
soldier-system design, reduce life-cycle costs, and optimize total system 
performance (MANPRINT, 2005). MANPRINT is divided into seven HSI domains 
that work towards achieving this goal: 

• Manpower 

• Personnel 

• Training 

• Human Factors Engineering 

• System Safety 
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• Health Hazards 

• Soldier Survivability 

These domains interact with each other to varying degrees, depending on 
the system. Each domain has cost implications and potential cost-savings 
associated with them (MANPRINT, 2005). Changes in a domain will affect its 
cost and the costs of other domains that are related to a system. This creates a 
“trade space,” or the range between objective and threshold values that a 
program manager (PM) can manipulate. Managing the trade space among the 
HSI domains helps to effectively balance cost, schedule, and total system 
performance. The present research will focus on three HSI domains: personnel, 
training, and soldier safety. 

Personnel: MANPRINT defines the personnel domain to be the cognitive 
and physical capabilities that are required to be able to train for, operate, 
maintain, and sustain materiel and information systems (MANPRINT, 2005). 
These capabilities are normally reflected as knowledge, skills, abilities, and other 
characteristics (KSAOs). The Army seeks to place soldiers in military 
occupational specialties (MOS) by identifying target audiences for a system and 
comparing cognitive and physical demands with projected personnel inventories. 

A personnel attribute examined in this study is memory. Personnel 
selection criteria based on ability to be trained and process information are 
critical for mechanized crewmen who must remember characteristics of 
numerous vehicles in an operational environment in order to make the right 
decision. The Global War on Terrorism fratricide data show that most causal 
factors relating to friendly fire incidents were cognitive, perceptual, and psycho- 
behavioral factors of individuals (Webb & Hewett, 2010). This emphasizes the 
need to properly fill duty positions with the right mixture of personnel capable of 
remembering TVI training. 

Training: Training is the instruction, education, on-the-job, or self¬ 
development training required providing all personnel and units with essential job 

skills, and knowledge. Training addresses the gap that exists between a target 
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audience’s existing level of knowledge and the knowledge required to effectively 
operate and support a system (MANPRINT, 2005). 

Sustaining the baseline training proficiency that personnel achieve during 
initial training could reduce training costs significantly. Also, sustainment training 
may reduce the need for time-consuming remedial training and provide more 
stable performance levels throughout a soldier’s time in service. Training is more 
important in preventing fratricide than technological countermeasures (Webb & 
Hewett, 2010). This research will examine TVI training presentation in order to 
assess performance decrement over time. 

Soldier Survivability; Soldier survivability addresses the characteristics 
of a system that can reduce fratricide, detectability, and probability of being 
attacked, as well as minimize system damage, soldier injury, and cognitive and 
physical fatigue (MANPRINT, 2005). The U.S. Army added soldier survivability 
to its MANPRINT domains to focus attention on those aspects of a system that 
that can minimize the loss of forces due to friendly fire. 

Mitigating the effects of decay on a task such as TVI has the great 
potential to decrease fratricide. Engagements happen in a manner of seconds. 
Therefore, providing soldiers with a framework to accurately retrieve learned 
information can aid in the shoot/don’t shoot decision-making process. 

E. THESIS ORGANIZATION 

This thesis is divided into six chapters. Chapter II reviews literature 
regarding forgetting theories and models, military skill retention, the effect of 
fatigue on memory, and TVI training development. Chapter III describes the 
methods used in the experiment and the research design of the study. Chapter 
IV presents the results of the experiment. Chapter V discusses the analysis of 
the results. Chapter VI provides conclusions and recommendations for future 
research. 
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II. LITERATURE REVIEW 


A. BACKGROUND 

1. Decay Theory 

Factors related to the memory processes of encoding, storage, and 
retrieval help determine what is remembered or forgotten (Brewer, Zaho, 
Desmond, Glover, & Gabrieli, 1998). Brewer et al. (1998) examined the degree 
of activation in the hippocampal regions of the brain and found that the extent to 
which a visual experience is encoded predicts whether it will be remembered or 
forgotten. Sensory inputs are transformed into memories, but the stability of 
these memories depends on the quality of encoding (Brainerd, Reyna, Howe, & 
Kingma, 1990). Once a memory is encoded and stored in a memory trace, it is 
vulnerable to a gradual degradation due to decay. Decay can cause storage 
failures to occur and make remembering difficult (Brainerd et al., 1990). 

Brown (1958) and Peterson and Peterson (1959) conducted early studies 
examining decay theory. Brown (1958) proposed a decay theory hypothesis. 

When a sequence of items is presented, the interval between the 
perception of each item and the attempt to recall that item will 
depend on the length of the sequence. If the sequence exceeds a 
certain length, decay of the memory traces of some of the items 
will proceed too far for accurate recall of the sequence to be 
possible. This length is the memory span. Thus the trace- 
decay hypothesis can explain both the origin of the span and 
why forgetting occurs when the span is exceeded, (p. 13). 

Brown suggested that forgetting would not be expected if subjects were 
able to rehearse the presented material during the interval between presentation 
and the attempt to recall the item. Rehearsal increases the probability of 
counteracting the effects of decay (Brown, 1958). However, memory decay will 
occur when the memory trace is not activated. Brown also proposed that once a 
stimulus is perceived, it decays rapidly initially. The memory trace is still useable 
through recall and recognition until retention reaches a critical point where the 
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memory trace is no longer reliable. Beyond the laboratory, in many operational 
settings, rehearsal of information stored in memory does not happen at fixed 
intervals. Soldiers engaged in combat operations will experience periods of low 
and high operational tempo (OPTEMPO). During periods of high OPTEMPO, 
decay may occur because the time between information presentation and recall 
can exceed the memory span. 

Peterson and Peterson (1959) expanded on Brown’s work. Peterson and 
Peterson’s study examined forgetting of verbal stimuli at time intervals from three 
to 18 seconds. Their study included having participants count backwards 
between presentation of a verbal stimulus of three letters and then asking them 
to recall the letters. The researchers found evidence that memory decay became 
worse over longer time intervals, even when the effects of rehearsal were 
blocked. The relationship between recall and passage of time decayed 
exponentially and followed forgetting curve postulations (Ebbinghaus, 1913; 
Bailey, 1989). 

Signal detection theory is a viable tool to assess recognition memory 
(Wixted, 2007). Gold, Murray, Sekuler, Bennett, and Sekuler (2005) assessed 
memory decay within the scope of signal detection theory. Gold et al. proposed 
that memory decay was not just a function of acquiring more noise in the form of 
new memories. The new memories coupled with the passage of time took away 
from memory resources and allowed time to decay memory traces. Because of 
this. Gold et al. (2005) suggested that memory decays in a deterministic way and 
that future studies should seek to find the “forgetting function” that models how 
visual memory decays as a function of time. 

2. Recognition Memory 

Recognition memory involves selecting accurate responses from a choice 
of alternatives (Wisher, Sabol, & Ellis, 1999). This contrasts with recall memory 
where no alternatives are presented and a participant must produce an answer 
through more narrative methods. Recognition memory provides superior results 


10 



compared to recall memory due to the cues provided that drive recognition 
(Wisher et al., 1999). Recognition memory is dependent upon previous 
processing of the stimuli so that a judgment can be made about whether a 
previous encounter exists. The present study focused on analyzing recognition 
memory due to the visual cues presented in ROC-V. Participants did not freely 
recall any given name of a vehicle due to multiple-choice post-tests that provided 
an additional cue to supplement the visual cues. 

Warrington (1984) developed the Recognition Memory Test (RMT) to 
determine memory deficits across a range of factors and materials. The test 
provides the ability to compare and assess verbal and non-verbal memory. 
Recognition memory tests are common methods for measuring memory 
performance (Snodgrass & Corwin, 1988). Successful recognition memory is 
dependent upon the right storage and retrieval of memory information of a 
particular event (Tulving & Thompson, 1971). Retrieval cues used in recognition 
memory depends on how the item to be retrieved was encoded at perception 
since encoding shapes the memory trace process. The memory trace serves as 
the link between how well a memory was encoded and the extent to which 
retrieval cues are effective (Tulving & Thompson, 1973). The need to examine 
recognition memory through the RMT is necessary given that explicit semantic 
memories such as recognition memory are not as enduring as implicit memories 
(Woltz & Shute, 1995). 

Snodgrass and Corwin (1988) conducted an assessment of recognition 
memory tests used to examine memory deficits in cases of dementia and 
amnesia. The study found that as the number of similar features increases 
between test items and one’s internal representation of the test items, subjective 
familiarity with new and old test items increases. This phenomenon can create 
high levels of familiarity for different test items if stimuli are not properly encoded 
at the moment of perception (Snodgrass & Corwin, 1988). Misidentification from 
memory can occur from having the wrong memory representations activated and 
by confusion from prior test and distractor items. Therefore, proper encoding of 
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stimuli at perception is important to prevent grouping new images with previously 
learned and poorly encoded memories. 

Studies that examine recognition typically return high levels of accuracy 
below which performance cannot fall (Brainerd et al., 1990). Recognition 
memory tests also use more memorable testing materials in the form of photos 
(Brainerd et al., 1990). Recognition tests have also used short time intervals 
between training and testing from 1-3 days (Merriman, Azmita, & Perlmutter, 
1988). Forgetting does not begin immediately after being introduced to 
information. Memory normally improves for periods of a few hours to a few days. 
This phenomenon occurs in both children and adults. Therefore, in order to gain 
more insight in age differences with regard to levels of forgetting, memory tests 
should be administered at intervals beyond the common 1-3 days to account for 
the beginning of the forgetting period (Brainerd et al., 1990). 

3. Effects of Fatigue on Recognition Memory 

Jenkins and Dallenbach (1924) examined Ebbinghaus’ (1913) original 
study of forgetting. The purpose of the study was to compare forgetting rates 
after varying hours of sleep and wakefulness. Participants in the study learned 
nonsense syllables in a similar fashion to Ebbinghaus’ original study. 
Participants were tested at intervals after sleep and wakefulness. The 
participants remembered twice as many words after the sleep interval than the 
wakefulness interval. This early work by Jenkins and Dallenbach led to further 
studies examining the role of sleep and its effects on memory processes from 
encoding to retrieval. Walker and Stickgold (2006) found that the effects of sleep 
deprivation over a 36-hour period severely impaired encoding of memories. This 
resulted in significantly worse retention involving learning lists of words in the 
days after the initial training. 

Sleep is an important factor in the consolidation of memories 
(Drosopoulous, Wagner, and Born, 2005). Memory consolidation is the process 
by which new memories consolidate over time and become resistant to 
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interference without any form of rehearsal (McGaugh, 2000). Sleep also helps 
enhance memory consolidation after learning (Rasch & Born, 2008). Studies 
have hypothesized that the different stages of sleep affect learning and 
memories in different ways (Maquet, 2001; Stickgold, Hobson, Fosse, & Fosse, 
2001; Kami, Tanne, Rubenstein, Askenasy, & Sagi, 1994). Recent studies 
suggest that sleep on the night after a visual perceptual learning session is 
critical to satisfactory follow-on performance (Maquet, 2001). 

Sleep enhances explicit recognition memory (Drosopoulous et al., 2005). 
Drosopoulous et al. (2005) compared the effects of sleep on explicit and implicit 
memory formation for a word-discrimination task. Participants studied two lists of 
words before a three-hour retention interval of sleep or wakefulness. Participants 
then completed a recognition test during early parts of the sleep period where 
slow wave sleep (SWS) occurs and late at night towards the end of a sleep cycle 
when REM occurs. Drosopoulous et al. found that sleep during the SWS period 
significantly increased recognition for the list of words. 

Rapid eye movement (REM) sleep has an integral role in memory 
processing for procedural learning of a visual memory task (Stickgold et al., 
2001). Kami et al. (1994) found that performance of a visual discrimination task 
improved after participants were able to have a normal sleep cycle the night 
following training. Furthermore, human memory consolidation processes depend 
on REM sleep. 

4. Background Summary 

Previous studies suggest that memory decays at an exponential rate over 
time. This is important when analyzing military tasks that are dependent on 
memory retrieval. Consequences can be fatal when warfighters forget TVI 
training. Rehearsal improves performance on such tasks, but memory is still 
susceptible to decay. Recognition memory assesses the ability to choose among 
alternatives, whereas recall involves selecting a correct response without 
alternatives. Sensing and processing of visual cues are significant factors of 
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accurate recognition memory processing. Therefore, proper encoding and 
storage of visual stimuli in a TVI task are essential. Given that memory decays 
and that improper recognition memory processing put performance at risk, it is 
necessary to assess memory through a TVI task. Furthermore, sleep and fatigue 
are essential throughout the entire memory process. Adequate levels of rest 
strengthen memories and help make them less susceptible factors such as 
decay and interference. Memory and those factors that interact with it are key 
elements of training TVI. 

B. RECOGNITION OF COMBAT VEHICLES 

1. Evolution of Vehicle Identification Training 

According to the DOD’s 2009 Unmanned Systems Integration Roadmap, 
future systems must focus on specific warfighter capability needs. The second 
most important capability listed in the roadmap is target identification and 
designation (Department of Defense, 2009). The DOD highlights the need to 
precisely obtain positive military target identification in order to achieve faster and 
more accurate effects on the battlefield. This issue is not new, however. 
Identification of military vehicles has a long history in the U.S. military. 
Recognition of combat vehicles became a pressing issue when the first tank-on- 
tank battle occurred during World War I (Briggs & Goldberg, 1995). The issue of 
vehicle recognition extended beyond armor crewmen and reached to dismounted 
soldiers and aviators who also had to identify friend or foe on the battlefield. The 
sum of these efforts on the ground and in the air helped achieve tactical success. 
Gibson (1947) published a description of a research program used during World 
War II to improve aircraft identification and the general education of visual 
learning. Gibson described the aim of training aircraft recognition as two-fold: 
identification of friend or foe and range determination based on the size of the 
aircraft. Flight students who were taught to identify general forms of aircraft 
versus distinguishable characteristics of aircraft performed poorly on recognition 
tasks. The flight students repeatedly asked how they could distinguish between 


14 



two planes that closely resembled each other. Gibson found that these results 
necessitated a paradigm shift in training to move beyond a general total form 
approach of identification to an approach that included features of vehicles that 
distinguish them from others. Gibson developed a still and motion picture 
training series for the Army Air Force to train pilots and aircrews in recognizing 
friendly and enemy aircraft. 

In the 1980s, the U.S. Army developed a basic combat vehicle 
identification (CVI) program (Warnick & Smith, 1989). The training program 
included instruction for both training and operational units. The program 
consisted of visual and thermal vehicle identification using 35mm slides. Combat 
identification flash cards supplemented the training slides (Warnick & Smith, 
1989). Thermal images were available on a limited basis. Thermal imagery was 
needed because the M60A3 tank was equipped with infrared sights and the new 
Ml tank possessed an advanced Thermal Imaging System (TIS). Early iterations 
of thermal images had to be simulated and consequently had low fidelity. This 
program was the Army’s first attempt at training its soldiers on thermal imagery 
(Smith, Shope, Heuckeroth, Warnick, & Essig, 1987). However, these solutions 
were interim and partial (Dyer, Westergren, Shorter, & Brown, 1997). 

Training and testing of CVI consisted only of photographs and two- 
dimensional line drawings (O’Connor & O’Kane, 2000). In 1987, the Army 
supplemented its CVI training program with Graphic Training Aid (GTA) 17-02-13 
(Department of the Army, 1987). GTA 17-02-13 is a deck of 50 cards available 
to soldiers as a self-study tool. The two-sided cards provide common views of a 
vehicle on one side with vehicle descriptions on the other (Figure 2). This 
approach allowed soldiers to train on their own without having to use robust 
resources to facilitate their learning. The low quality of the images and 
instruction provided on the cards led to the need for an improved training 
approach to vehicle identification. 
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Figure 2. GTA for the M2/M3 Bradley 

(from Department of the Army, 1987) 


The U.S. Army tested a thermal training program, CVIPLL/S in 1997. The 
program used actual thermal imagery and set out to field a generic thermal 
training program for all of the Army’s thermal sights (Dyer et al., 1997). 
CVIP/.L/S allowed trainees to progress through the program at their own pace. 
This process was supported by previous research (Gibson, 1947; Warnick & 
Smith, 1989) that showed significant differences in individual ability to develop 
recognition skills. The CVIP/us framework became the foundation of ROC-V. 
The U.S. Army’s Communications-Electronics Research, Development and 
Engineering Center’s Night Vision and Electronic Sensors Directorate (CERDEC 
NVESD) developed ROC-V and it was fielded as a computer-based training 
system in 1999. ROC-V presents vehicle signatures through a variety of 
simulated sights that are currently in use in the United States (Boyd et al., 2005). 
CERDEC NVESD later developed ROC-V into a mobile application that was 
available to Android and Apple iOS users in 2013 (Rominiecki, 2013). This 
capability enabled soldiers to train on vehicle recognition on their mobile devices 
and on their own time, much like the GTA cards. However, with the mobile 
version of ROC-V, soldiers also had access to thermal signatures of vehicles. 

2. Training Visible and Thermal Images 

Recognizing vehicles is a difficult task to train and sustain (Dyer, Shorter, 

& Westergren, 1998). Recognizing thermal images of vehicles is even more 

difficult. Thermal image signatures differ from visible images of a vehicle’s 
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superficial surface signatures. Thermal signatures are dynamic and are the 
function of radiation from vehicle heat sources, frictional sources, weapons 
systems, and solar sources during daylight hours (O’Connor & O’Kane, 2000). 
Further, operators manipulate brightness and contrast settings on sights that 
leave no two thermal images exactly alike (Dyer et al., 1997). The differences 
between thermal and visible signatures can cause confusion when trying to 
identify a vehicle, even though the physical surface structure of a vehicle may be 
known to the warfighter. O’Connor and O’Kane (2000) concluded that familiarity 
with vehicles in two-dimensional line drawings and photos without knowledge of 
thermal signatures was not sufficient to effectively complete combat vehicle 
identification with thermal sights. 

While multiple types of sensors and sights exist today, the common 
technology used in most modern weapons systems is forward-looking infrared 
(FLIR, also known as thermal). Thermal sensors transform infrared energy 
produced by heat into visible light (Palmer, D’Agostino, & Lillie, 1982). This 
transformation presents a new stimulus to an operator that is different than 
visible images. This phenomenon necessitates thermal imagery training since 
high performance on visible image identification does not guarantee the same 
level of expertise on FLIR identification (Dyer et al., 1997). Another important 
issue for thermal image training is the need to use non-daylight thermal images. 
Environmental heating and cooling can change vehicle signatures enough to 
appear different to a warfighter (Dyer et al., 1998). 

Thermal signature training has a theoretical basis derived from perceptual 
tasks. Biederman and Shiffrar (1987) suggested that with perceptual-learning 
tasks, basic instruction should include location and shape of critical features that 
can help increase recognition performance. Their research focused on sexing of 
day-old chicks with novice observers and experts. Within an hour of training 
using the specific recognition strategy, the novices were able to determine the 
sex of chicks at the same level of accuracy as the experts. 
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Knowledge of a vehicle’s critical features is necessary for proper 
identification, according to a 1997 study by O’Kane, Biederman, Cooper, and 
Nystrom. The results of their study indicated that confusing two objects could be 
predicted based on the number of similar major characteristics shared by the 
vehicles. Two vehicles that share three similar major characteristics would not 
be as easily confused as vehicles that shared five characteristics. Warfighters 
must be able to observe and process critical cues that are visible in a target 
picture. These cues include a vehicle’s hull, armament, turret, and suspension 
(Warnick & Smith, 1989). Time pressure further stresses a warfighter’s decision 
to shoot because an engagement between two vehicles can be complete within 
10 seconds (Briggs & Goldberg, 1995). 

Biederman (1987) proposed a recognition-by-components theory that 
suggested an object could be identified by breaking it into major parts. This 
approach follows the training methodology used to evolve CVI and ROC-V, which 
focused vehicle identification on distinguishing features. Dyer et al. (1997) 
determined that if thermal signatures can be described in verbal terms, then 
training approaches should use verbal instruction. However, if thermal 
signatures are too vague to be described by verbal instruction, then perceptual 
learning should be stressed in the training scenario. 

The order of the presentation of thermal and visible image sets is not a 
critical factor in determining a program of instruction (Smith et al., 1987). 
However, the presentation of image aspects is a critical factor in both visible and 
thermal vehicle identification. Dyer et al. (1998) recommended a minimum of 
eight aspect angles to be presented in training sets. ROC-V presents eight 
aspect angles in both its visible and thermal training modules. Soldiers perform 
at higher levels on a TVI task when they have practiced recognizing a vehicle’s 
identity and aspect (Dyer et al., 1997). 

Simulation-based military training can overwhelm a learner if too much 
information is presented (Andrews & Bell, 2000). During the evaluation of 

computer-based training on vehicle identification. Dyer et al. (1997) found that 
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too many vehicles with multiple aspect images increased training time and 
soldiers did not reach specified performance standards easily. The results of the 
computer-based training assessment for vehicle identification recommended that 
no more than six vehicles should be presented in a training set (Dyer & Salter, 
2001). Six vehicles with eight aspect angles yields 48 images as opposed an 
original proposal of eight images with eight aspect angles (64 total images). 

3. Vehicle Identification Skiii Retention 

Dyer et al. (1997) proposed that future research building on CVIPLL/S 
should examine how quickly soldiers lose their vehicle identification skills and what 
factors affect forgetting rates. The need to address these issues is intuitive, as 
commanders need to know the current skill level of their soldiers and how to 
develop training schedules around their performance. However, research efforts 
have been limited to address these issues. The current study addresses the effect 
of time and image presentation factors on performance decrement for the TVI task. 

Heuckeroth, Smith, and Shope (1988) tested four groups of 45 soldiers at 
Fort Hood, Texas, using visible images available in the Army’s CVI training 
program. Their research had two findings relevant to the current research: 1) the 
greatest decrement in TVI performance occurred within a three-week interval 
immediately following training; and 2) different soldiers reach specific levels of 
performance initially, but performance on the TVI task decays at generally the 
same rate for all. Heuckeroth et al. (1988) suggested that performance 
decrement between weeks three and nine was minimal. Therefore, they 
recommended that retraining efforts be started after nine weeks in the interest of 
managing training resources. 

General Technical (GT) scores on the Armed Services Vocational Aptitude 
Battery (ASVAB) are highly correlated with performance on a TVI task (Shope et 
al, 1988; Warnick & Smith, 1989). These scores were not collected in the 
present effort, as commissioned officers are not required to take the ASVAB and 
thus obtain a GT score. However, soldiers with GT scores over 100 and those 
with less than 99 showed a significant difference in performance of basic vehicle 
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identification (Warnick & Smith, 1989). This difference in performance difference 
suggests that training programs should allow for more training time for personnel 
with lower GT scores. Another consideration of GT scores is that a possible 
relationship exists with long-term memory and the implication for how frequently 
training should be repeated. 

4. Recognition of Combat Vehicies Summary 

Vehicle identification training has evolved tremendously over the last 70 
years. However, thermal imagery is still relatively new and unfamiliar to 
warfighters. These factors necessitate added attention to training thermal 
imagery. While previous studies indicate that ASVAB scores are predictors of 
performance on a TVI task, studies that examine thermal imagery are limited. 
While the military identified the need to create a more robust training platform for 
thermal images, analysis of the differences in performance between thermal and 
visible images are needed to better understand thermal TVI skills. 

C. HYPOTHESES 

The literature review has identified several factors that relate to 
performance decrements in a TVI task. The issues reviewed in this chapter have 
been focused on addressing two of the most relevant factors in this study’s 
objectives: time between training and image presentation. Two of the most 
relevant factors to this study’s objectives are time between training episodes and 
image presentation. Recognition memory capability is used as a control variable 
across factors. 

The alternative hypotheses derived from the research questions are: 

• Hai: Performance on a TVI task without rehearsal degrades at an 
exponential rate over time. 

• Ha 2 : Performance on a TVI task is different between visible and 
thermal image presentation. 

• Has: Recognition memory capability predicts performance on a TVI 
task. 
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III. METHOD 


A. OVERVIEW 

A controlled laboratory experiment was conducted to assess memory 
decay after completing TVI training using the ROC-V application. This study was 
a 2 X 5 mixed design that compared recognition of vehicle images (thermal or 
visible) at different time intervals (1, 2, 4, 8, and 16 days). Participants were 
randomly assigned to one of five groups that corresponded to one of the time 
intervals. Prior to the TVI training, participants completed a recognition memory 
test to determine individual memory capabilities. Participants completed a sleep 
log for the duration of their involvement in the experiment. Table 1 highlights the 
experimental design for this study. 



TIME SINCE TRAII 

NING 

1 DAY 

2 DAYS 

4 DAYS 

8 DAYS 

16 DAYS 

VEHICLE 

IMAGES 

(THERMAL 

AND 

VISIBLE) 

GROUP 1 






GROUP 2 






GROUP 3 






GROUP 4 






GROUP 5 


Table 1. Experimental Design. 


The study consisted of an initial recognition memory test (RMT) on one 
day, followed by a training day 72 hours later, and a final testing day. The initial 
recognition memory test day consisted of taking the RMT until participants 
achieved a 100% score. The second day consisted of completing the RMT and 
the ROC-V training thermal and visible modules. The ROC-V training modules 
included a pretest, a training presentation on vehicle identification basics and 
training on identifying the vehicles in the training sets. Multiple check-on-learning 
quizzes and a final post-test accompanied each vehicle training set. This training 
sequence was the same for the thermal and visible image modules. Participants 
trained until they achieved a 90% or higher score on the vehicle post-tests. 
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Participants repeated the thermal and visible post-tests and completed another 
iteration of the RMT on their group’s second day. 

B. PARTICIPANTS 

1. Selection 

The Naval Postgraduate School Institutional Review Board reviewed and 
approved the research methods used in this study. Participants were treated in 
accordance with Department of the Navy and American Psychological 
Association (APA) ethical standards. All participants were informed of their rights 
as subjects in the study and signed informed consent forms. Participants were 
recruited through personal communication and email. The exclusion criterion for 
the study was any service-certified tank or Bradley master gunner students. 
Master gunners are considered experts in vehicle identification and are certified 
to train and certify TVI test results. 

2. Demographics 

Thirty participants completed the study (average age = 32.1, standard 
deviation [SD] = 4.56). Participants included 29 male officers and one female 
officer from the U.S. Army, Navy, Marine Corps, and Air Force. The participants’ 
specialty branches within their respective services included combat arms, 
combat support, and combat service support. All participants were graduate 
students at the Naval Postgraduate School. The pay grades of the participants 
included 21 0-3s, 8 0-4s, and 1 0-5. The participants’ time in service ranged 
from 5 years to 25 years (average time in service = 11.1, SD= 5.07). Participants 
had no prior certification as an Ml tank or Bradley Fighting Vehicle master 
gunner. Twenty-eight participants reported having visual acuity at or correctable 
to 20/20, while two participants reported having visual acuities of 20/30 and 
20/40. Participants reported any previous TVI training and the time since the last 
training session. Eleven of the 30 participants had previous TVI training within 
the last 11 years (average time since training = 5.55, SD= 3.64). 
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c. 


MATERIALS 


1. Equipment 

The experiment used a color projector to display the recognition memory 
test to groups of participants for the initial RMT. The experiment used 
workstations for running the ROC-V program. The workstations were comprised 
of a Dell computer, one flat-panel monitor, and audio headsets (see Figure 3). 



Figure 3. Participant at ROC-V Workstation 


The exact specifications for the equipment used in the study were: 

• 1 X Dell Optiplex 380 desktop, running Windows 7, with an Intel 
Core Duo 2 processor, 2.93 GHz, 2.0 GB RAM 

• 1 X Dell 2007FP Ultrasharp 20” LCD flat panel display 

• 1 X Audiovox stereo headphones 

• 1 X InFocus IN2102 EP DLP Projector 
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• 1 X Apple MacBook Pro laptop, running OS X 10.9.2, with an Intel 
Core i5 processor, 2.4 GHz, 4.0 GB RAM 

• Microsoft PowerPoint for Mac 2011, version 14.3.9 

• Recognition of Combat Vehicles, version 10.1.1.19 

D. VARIABLES 

1. Independent Variables 

a. Time since Training 

The time between initial training and second testing varied between the 
groups. Training intervals occurred at 1, 2, 4, 8, and 16 days. Each group was 
randomly assigned to one of these days. These intervals were chosen based on 
the binary algorithm used by Bailey (1989). 

b. Vehicie image Presentation 

Ten visible and ten thermal two-dimensional images were used for training 
and testing. Twenty separate vehicles were used to build the two test sets 
(Table 2). Vehicle images were pre-loaded in the ROC-V program. All 

participants trained on all vehicles in both sets. 


Thermal Images 

Visible Images 

Ml Abrams 

M2A2/M3A2 

Bradley 

2S1 Gvozdika 

AAV7 

T-72 

BTR-70 

Leopard II 

Stryker MGS 

BMP-1 

ZSU-23-4 

Shilka 

M997 HEMTT 

FV4034 
Challenger 2 

LAV-25 

M113 

M557/1068 

CPV 

M1083A1 

FMTV 

M551 

Sheridan 

M60A3 Patton 

BRDM-2 

SCUD B 
(9K72 Elbrus) 


Table 2. Test Vehicle Sets 
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2. Dependent Variable 

a. ROC-V Post-Test Accuracy 

The ROC-V program provided feedback from the post-test that included 
the final total of correct vehicles and accuracy percentage. These results were 
used to determine any changes from initial proficiency and differences between 
the image conditions. 

3. Covariate Variable 

a. Sleep 

Sleep influences an individual’s ability to properly encode and store 
information in long-term memory. Participants were asked to complete a sleep 
log for the duration of their involvement in the study to determine any effects or 
interactions with their performance. Participants who completed their second test 
at the one and two day training intervals provided information on their last 72 
hours of sleep. 

4. Control Variable 

a. Recognition Memory 

Due to the study’s objective of assessing performance based on a 
recognition task, determining individual memory levels was necessary. A 
recognition memory test provided an assessment of memory retention 
(Warrington, 1984). The RMT used in this study was based on Warrington’s 
(1984) framework. The test was developed using random images available from 
Microsoft PowerPoint for Mac 2011. The RMT (see Appendix A) consisted of 12 
images presented on individual slides to the participants on a projector. After all 
12 images were presented, participants would see an array of 36 images 
randomly placed on one slide. The original 12 images were included in this 
array. Participants would then write down the images that they recognized from 
the original set of 12. Participants would complete the RMT until they were able 
to correctly identify all 12 of the original pictures. Participants would then be 
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presented with the array of 36 images 72 hours later and on their second test day 
to determine their level of performance decrement with the RMT. Participants 
were not aware that they would take the RMT a second and third time. 

E. PROCEDURE 

Participants signed up for an initial RMT and then gathered in the Human 
Systems Integration Laboratory (HSIL). The researcher introduced the 
participants to the study. Participants completed an informed consent form 
followed by a demographic questionnaire (see Appendix B). Participants 
completed the initial RMT and then received an email with the time for their ROC- 
V training 72 hours later. 

Participants completed the RMT first by watching a timed presentation of 
the 12 images followed by the array of 36 random images. Participants were 
required to correctly remember the 12 images presented in the RMT. 
Participants completed the RMT until they had achieved this mark. The 
researcher recorded the number of trials that it took a participant to identify all 12 
of the images in the RMT. After 72 hours had elapsed, participants returned to 
the HSIL where they were presented with the array of 36 images from the RMT. 
Participants were instructed to write down the images they remembered. At the 
completion of the initial training and testing day, the researcher randomly 
assigned participants to one of the five groups. No significant differences 
between the initial RMT and RMT that occurred 72 hours later led to the 
assumption that memory ability variability was not a potential confound. 

Participants were then oriented to their computer workstation and given a 
set of instructions with screen shots of ROC-V to assist them with completing the 
training modules. Participants then opened the ROC-V program and entered an 
assigned participant number. Participants completed a ten-question pretest with 
randomly generated vehicles within the thermal image set. Participants were 
instructed to complete the first vehicle identification module. The first module 
consisted of thermal images. The self-guided training sequence started with an 
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introduction to basic vehicle identification techniques. Participants then 
completed the self-guided training module for the vehicle set. A check-on- 
learning quiz was administered after the second vehicle and every subsequent 
vehicle. The quiz presented a vehicle name and two vehicle images or one 
vehicle image with up to five choices of vehicle names. Participants had to 
match the correct vehicle with the displayed name. Participants had to score 
higher than 90% during the check-on-learning quizzes in order to progress to the 
next vehicle in the set. Participants then completed a 10-question post-test for 
the thermal image set. Ten vehicles were randomly generated for the post-test 
by ROC-V for every testing sequence. The post-test assessed participants’ 
ability to identify combat vehicles in thermal images. Participants were shown a 
target image and given 10 seconds to select the answer from a set of five 
choices. Participants had to correctly identify 90% of the vehicles in the post-test 
to be considered trained for the study. Participants who did not meet the 90% 
criteria on the post-test were allowed to retrain at their discretion and then 
repeated the test until they reached the passing standard. Participants then 
completed the visible vehicle image set in the same training sequence as the 
thermal set. Once participants had reached 90% proficiency on the visible 
vehicle set, the researcher provided them with a sleep log (see Appendix C) and 
instructions on how to complete it. Participants agreed to not conduct any 
supplementary training on vehicle identification or to discuss the experiment until 
all second testing iterations were complete. 

Participants returned to the HSIL on their second assigned day to 
complete both the visible and thermal post-tests again. Participants were shown 
the 36-image array from the RMT and asked to write down the images that they 
remembered from the initial test before taking the ROC-V post-tests. Participants 
used the same workstations with ROC-V to complete the post-test. Participants 
also turned in their completed sleep logs. The researcher interviewed the 
participants to solicit feedback on the study and to determine whether any 
extraneous life events might have impacted their performance in the study. 
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IV. RESULTS 


A. OVERVIEW 

The present study collected ROC-V test data, recognition memory test 
data, participant sleep data, and demographic data during a threat vehicle 
identification task. Analysis was conducted to determine the effect of the 
passage of time (1, 2, 4, 8, and 16 days) on vehicle identification to address 
Hypothesis One. Analysis was also conducted to determine the effect of thermal 
and visible image presentation on TVI test scores to address Hypothesis Two. 
Further analysis was conducted to assess recognition memory in predicting 
performance on a TVI task to address Hypothesis Three. An alpha level of 0.05 
was used for all statistical tests. There were no significant findings with the 
visible image condition across all factors. 

B. GROUP ASSIGNMENTS 

Results from the second RMT that participants took 72 hours after the 
initial battery did not produce significantly different results to establish sub¬ 
groups. No participants met the exclusion criterion of being a tank or Bradley 
master gunner. An analysis was conducted to determine if participants with 
previous TVI training were randomly assigned to one of the five groups (see 
Table 3). A significant difference was not detected using Fisher’s Exact Test 
(p=0.61). Therefore, the assumption was made that participants were randomly 
assigned to address the potential confounding variable of previous training in the 
following analysis. 



GROUP 1 

1 DAY 

GROUP 2 

2 DAYS 

GROUP 3 
4 DAYS 

GROUP 4 
8 DAYS 

GROUP 5 
16 DAYS 

PREVIOUS 

TVI 

TRAINING 

Yes 

5 

4 

4 

4 

2 

No 

1 

2 

2 

2 

4 


Table 3. Previous TVI Training by Group. 
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c. 


ROC-V POST-TEST ACCURACY 


All participants completed a thermal and visible TVI post-test on their 
assigned day. Scores were collected that gave the number of correct responses 
for the two tests. Figure 4 shows the number of correct responses for all 
participants by their assigned groups and the average number of correct 
responses across all five groups (M=7.60, SD=1.65) for the thermal post-test. 
Figure 5 depicts the results and the average number correct for all five groups 
(M=9.33, SD=0.80) for the visible post-test. 
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Figure 4. Thermal Test Scores by Group 
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Figure 5. Visible Test Scores by Group 


To determine the effect of time on TVI post-test accuracy, a Fisher’s exact 

test for count data was conducted for the thermal and visible post-tests. A 
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significant difference was not found between groups for the thermal condition 
(p=0.97) or the visible condition (p=0.26). 


D. IMAGE PRESENTATION 

Thermal images present different cues than visible images. A Wilcoxon 
signed rank test was conducted to determine the impact of image presentation 
on the TVI post-tests. The Wilcoxon signed ranks test indicated that thermal 
image scores were significantly lower than visible test scores within all groups, 
Z=120.5, p<0.0001. Figure 6 shows the differences between each group’s mean 
scores on the two post-tests. 

Me an(Visible Test) & Mean(Thermal Test) vs. Days since Training ■Mean(Visibie Test) 

“1 I ^■Mean(Thermal Test) 



1 2 4 B 16 


Days since Training 

Figure 6. Mean Post-Test Scores by Group 
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E. COVARIANCE 

1. Recognition Memory Test 

An over-dispersed logistic regression model was fitted to determine the 
effect of recognition memory using the 72-hour RMT on thermal post-test 
performance. The relationship between the variables was significant (p=0.038), 
indicating that performance on the 72-hour RMT predicted post-test performance. 

2. Previous TVI Training 

Participants were asked if they had any previous TVI training. Eleven 
participants had previous TVI training, ranging from two to eleven years ago. 
Previous TVI training was assumed to have a binomial distribution and was used 
as a covariate during data analysis to account for previous training and on 
performance on the thermal post-test. The results from fitting a logistic 
regression model (see Figure 7, where x=1 if a participant had prior training and 
x=0 otherwise) indicate that if a participant had prior TVI training, the estimated 
proportion correct on the thermal post-test increased (p=0.035). 

A 2.19+.760X 

_ ^ 

P ~ 77 2.19+.760X 

\ + e 

Figure 7. Previous Training Logistic Regression Equation 

3. Time to Complete Training 

Participants were timed from the beginning of their ROC-V training 
sessions until the completion of their baseline ROC-V post-tests. A logistic 
regression model was fitted to analyze the relationship between time to complete 
training (see Figure 8, where x represents time to complete training) and 
performance on the thermal post-test. 
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2.83+.025x 


r , 2.83+.025X 

1 + e 

Figure 8. Time to Complete Training Logistic Regression Equation 

The results indicate that time to complete training is a significant predictor 
of thermal post-test scores (p=0.03). As time to complete the training session 
increased, performance on the thermal post-test decreased (see Figure 9). 


Pred Thermal Test & 2 more vs. Time to complete training 



• Pted Thermal Tost 

• Upper 95% Mean Thermal Test 

• Lower 95% Mean Thermal Test 
Smooth(Pred Themnal Test) 

- SmoothlUpper 95% Mean Thermal Test) 
SmoothfLower 95% Mean Thermal Test) 


Figure 9. Regression Plot for Time to Complete Training 


4. Sleep 

Participants completed sleep logs for their duration in the study. Average 
sleep times were computed for each participant and used as a covariate during 
data analysis to account for the effects of fatigue on participants’ recognition 
memory. The results of fitting a generalized linear model were not significant and 
indicated that sleep was not a significant predictor for the post-test scores or 
recognition memory test scores (p=0.501). 
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V. DISCUSSION 


A. TIME SINCE TRAINING (HYPOTHESIS 1) 

The hypothesis that performance on a TVI task degrades at an 
exponential rate over time was not supported. Statistical tests for both types of 
imagery and TVI performance retained the null hypothesis that performance on 
the TVI task does not degrade exponentially. This suggests that memory is not 
as susceptible to decay on a TVI task. 

1. Results Review 

This study examined the rates of forgetting on a TVI task over a 16-day 
period. Thermal and visible image sets were used to measure performance on a 
TVI task. An analysis of variance (ANOVA) was conducted on each group’s 
post-test scores for each image condition. Results showed that the passage of 
time since initial training was not significant between groups or image conditions. 
The independent variables of time since training and image presentation did not 
produce any significant interactions. 

2. Relevance of Results to Previous Research 

Dyer et al. (1997) proposed that future research should examine rates of 
forgetting and associated factors on a TVI task since research in this area was 
limited. Heuckerorth et al. (1988) determined that the greatest decrement in 
performance on a TVI task using visible imagery occurred within a three-week 
period following initial training. Based on this previous study, we predicted that 
performance would degrade significantly over the 16-day test period for both 
image conditions. The work of Ebbinghaus (1913) and Peterson and Peterson 
(1959) also led us to predict exponential rates of forgetting during the present 
study. The present study did not find forgetting to occur at these previously 
identified rates. This may be attributed to the difference between recall tasks 
completed in previous studies and the recognition task completed in ROC-V. 
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The battlefield task of TVI is representative of a cued recall task, where 
warfighters must recall a vehicle’s characteristics by seeing visual cues. 
However, the ROC-V post-test presents a recognition task where participants 
select from a choice of alternatives after the program presents visual cues. The 
difference between the task in the present study and the work of Ebbinghaus 
(1913) and Peterson and Peterson (1959) may account for the difference in the 
results not showing exponential decay on the ROC-V post-test. 

O’Connor (2002) found that the use of ROC-V helps to maintain TVI task 
proficiency above 90% with sustained usage. The results from the ANOVA 
indicate that ROC-V is indeed capable of achieving high levels of proficiency on a 
TVI task. However, the test metrics that ROC-V uses to measure performance 
are not representative of an actual TVI task since ROC-V uses a multiple-choice 
post-test. This sort of cueing does not exist for a warfighter who must remember 
vehicle characteristics without being presented response options. 

3. Explanation of Non-significance 

The lack of significance suggests that performance on a TVI task may not 
decay exponentially as predicted for either thermal or visible images. However, 
the battery used to test participants is not representative of a real-world TVI task. 
For instance, there is no menu of options that appears in a tank gunner’s primary 
sight that relays vehicle information once a target has been identified. 
Warfighters must be able to detect, identify, and classify images as friend or foe 
using their knowledge of the battlefield-operating picture and their previous 
training on TVI. Participants in the present study reported that they were able to 
deduce which vehicle was the right answer by process of elimination. The 
choices for each question were limited to the vehicles in the 10-vehicle training 
set. This provided further assistance in answering the post-test questions since 
participants were familiar with the vehicle choices and had no unfamiliar vehicle 
names to consider. 
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B. THERMAL VERSUS VISIBLE IMAGE PRESENTATION 

(HYPOTHESIS 2) 

The hypothesis that performance on a TVI task is different between visible 
and thermal image presentation was supported. Statistical tests for both types of 
imagery and TVI performance rejected the null hypothesis that performance on 
the TVI task does not differ between image presentations. The implications for 
this finding are important for both training plan design and fratricide mitigation. 

1. Results Review 

Participants’ performance was measured by analyzing the number of 
correct responses for the thermal and visible post-tests. A Wilcoxon signed rank 
test indicated that the visible image test led to significantly higher scores on the 
visible post-test than the thermal image post-test. 

2. Relevance of Results to Previous Research 

Dyer et al. (1998) reported that thermal image recognition was more 
difficult than visible image recognition. Results from the present study support 
this finding. Even though the multiple-choice post-tests allowed participants to 
deduce their answers, a significant difference between thermal and visible image 
conditions still existed. Dyer et al. (1997) suggested that differences in thermal 
and visible image presentation could lead to confusion when conducting a TVI 
task, even when the physical surface structure of the vehicle may be known to a 
warfighter. Some participants reported seeing some of the vehicles while 
stationed at previous duty stations. While participants may have had limited 
knowledge of some of the vehicles, their performance indicated that the thermal 
condition was still more difficult than the visible condition. 

The differences in thermal and visible images are informative. Thermal 
signatures are dynamic and change based on environmental and system 
operating conditions. Visible images do not present as many challenges to an 
observer (O’Connor & O’Kane, 2002). Dyer et al. (1997) proposed that high- 

performance on a visible TVI task was not predictive of the same performance on 
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a thermal TVI task. Results from the present study support this assertion since 
visible post-test scores were significantly higher than the thermal condition. All 
participants reported more difficulty with the thermal images due to their 
unfamiliarity with seeing infrared images. Participants reported that seeing 
thermal images was more difficult since they had to focus on specific cues in the 
images as opposed to being able to plainly see more vehicle characteristics in 
the visible condition. This follows the work of Palmer et al. (1982) who 
suggested that thermal sensors present a new and unfamiliar stimulus to an 
operator. 

3. Explanation of Significance 

Thermal imagery is a relatively new technology available to warfighters. 
As the United States began fielding systems with more robust infrared 
technology in the 1980s, the introduction of thermal images created a new 
challenge for warfighters since they had not previously been exposed to this sort 
of stimulus (Smith et al., 1987). Even though they knew some of the vehicles 
from previous work experience, participants reported being unfamiliar with seeing 
thermal images of vehicles. This led to confusion when taking the post-tests and 
lower scores on the thermal image set. 

Lower performance on the thermal condition matters. In Operation Desert 
Storm Given, 90% of engagements occurred using thermal sights (O’Connor & 
O’Kane, 2000) and 80% of Ml Abrams and M2 Bradley Fighting Vehicles 
destroyed were the result of friendly fire (Harmeyer & Antal, 1992). More 
recently, 55 fratricide instances led to 30 fatalities in the GWOT from September 
2001 to March 2008 (Webb & Hewett, 2010). These statistics along with 
performance on the thermal post-test in the present study highlight the need to 
effectively train and test warfighters with thermal imagery. 
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C. RECOGNITION MEMORY PREDICTING TVI PERFORMANCE 

(HYPOTHESIS 3) 

The hypothesis that recognition memory predicts performance on a TVI 
task was supported. Statistical tests for the 72-hour recognition memory test 
rejected the null hypothesis that recognition memory did not predict performance 
on the TVI task. This suggests that recognition memory ability is an important 
predictor of performance on a TVI task and should be considered in training 
design. 

1. Results Review 

Participants’ recognition memory was measured using a 12-image 
recognition memory test. Participants took the RMT on their first day in the study 
until they correctly remembered all 12 images. After 72 hours had elapsed, 
participants returned and took the RMT again. The effect of recognition memory 
on the thermal post-test was determined by using an over-dispersed logistic 
regression model. Results indicate that the 72-hour RMT scores were significant 
predictors of performance on the TVI task. Participants who did not remember all 
12 images on the RMT did worse on the thermal post-test than those who 
remembered all of the images. 

2. Relevance of Results to Previous Research 

Snodgrass and Corwin (1988) assessed recognition memory tests that 
were used to examine memory deficits. The study found that as similar features 
between test items in the RMT increased, subjective familiarity with test items 
also increased. This phenomenon caused misidentification by having the wrong 
memory representations activated. In the present study, participants reported 
seeing images on the RMT before, but confused them with other similar items. 
This finding highlights the importance of proper encoding at stimuli perception 
(Tulving & Thompson, 1971). Participants who did not encode properly during 
the RMT had difficulties on the ROC-V thermal post-test. Using retrieval cues 


39 



only works when proper encoding occurs at the beginning of the memory trace 
process (Tulving & Thompson, 1973). 

Recognition memory produces higher performance results than recall 
since cues are available to drive recognition (Wisher et al., 1999). While vehicle 
cues were available for participants to analyze, additional cues that were not 
representative of the TVI task were present in the form of multiple-choice 
answers. This indicates that even with the presence of additional cues that aided 
in identifying a vehicle, recognition memory is still a significant predictor of 
performance on the ROC-V thermal post-test. 

3. Explanation of Significance 

Results from the present effort follow previous research, which determined 
that recognition studies return levels of accuracy above 50% (Brainerd et al., 
1990). The findings from the present study suggest that high levels of 
performance on recognition memory predict performance on a TVI task. 
However, as previously stated, the ROC-V task does not accurately represent 
the warfighter’s battlefield task. Since results from the present study indicate that 
recognition memory is a significant predictor of performance on a TVI task, 
providing high fidelity training to more accurately gauge true TVI performance is 
necessary to give commanders a better understanding of the skill levels of their 
soldiers. 

Assessing recognition memory abilities is an important aspect of 
developing a TVI training plan. Identifying individuals with lower recognition 
memory levels can help allocate training resources appropriately. Warfighters 
who demonstrate lower performance on a RMT may need additional training to 
improve their recognition memory. Furthermore, warfighters with low RMT 
performance may need more frequent refresher training for other recognition 
memory tasks in their occupational field. 
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D. TIME TO COMPLETE TRAINING AND PREVIOUS TVI TRAINING 

Post-hoc analysis of covariates revealed two additional significant 
findings. Participants with previous TVI training did significantly better on the 
thermal post-test. Additionally, the longer it took participants to complete the 
training modules, the worse their performance was on the post-tests. 

1. Previous TVI Training Resuits Review 

A logistic regression model was fitted to analyze the effect of previous TVI 
training on TVI task performance. Results from the regression model predicted 
performance on the post-tests. Eleven participants reported having previous TVI 
training in some capacity throughout their careers. Training last occurred 
between 2 and 11 years prior to participation in the present study. These results 
suggest that providing some level of TVI training during a warfighter’s career can 
have a beneficial impact on future TVI performance. ROC-V’s availability on 
mobile platforms can potentially provide this sort of exposure to occupational 
specialties that do not focus on TVI as a part of their training regimen. 
Heuckeroth et al. (1988) recommended that retraining efforts be started after 
nine weeks of initial training. This finding suggests that warfighters can leverage 
ROC-V’s availability on mobile platforms to train on TVI to not only reach high 
levels of proficiency, but also maintain them. 

2. Time to Complete Training Results Review 

A logistic regression model was fitted to analyze the relationship between 

time to complete the training module and performance on the post-test. The 

results indicated that the longer it took participants to complete the training 

module, the lower their scores were on the thermal post-test. This finding 

contradicts Heuckeroth et al.’s (1988) assertion that while different Soldiers reach 

different levels of performance initially, TVI performance decays at the same rate 

for all. Participants who took longer to complete the study experienced greater 

performance decrement. This finding also indicates that warfighters who take 

longer on the self-guided training modules may have memory deficits that 
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preclude retention over long periods and result in lower levels of performance on 
a TVI task. Commanders can shape their training programs to identify threshold 
values for training time that their units must achieve. Warfighters who fall outside 
of those values can be identified and provided more frequent fresher training to 
maintain their levels of performance. This may also serve as an exclusionary 
measure in personnel selection for military occupational specialties that are 
highly dependent on TVI task skills. 
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VI. CONCLUSION AND RECOMMENDATIONS 


A. CONCLUSIONS 

During this study, we found that performance on a TVI task did not decay 
exponentially over a 16-day period. While time since training did not serve as a 
significant predictor of performance, time to complete training and previous TVI 
training did. Decay over time not being significant was contrary to expectations 
based on previous studies. Performance on the thermal image test was 
significantly worse than the visible image test across all participants. Post-hoc 
analysis determined that recognition memory ability on the TVI task was a 
significant predictor of performance. Average sleep was not a significant 
predictor of performance. 

The design of the experiment controlled for differences in individual 
recognition memory abilities in order to test the impact of passage of time in 
forgetting. While there was not enough difference between participants to assign 
them to groups with different recognition memory abilities, results of the 
recognition memory test was a significant predictor of performance on the TVI 
task. 

Time since training was not a significant predictor of performance but 
previous TVI training was. While many participants did not have previous formal 
TVI training, they reported that they have seen many vehicles as a part of their 
daily activities while working on military bases. This may have contributed to the 
higher test scores across all participants. Previous training occurred at least two 
years prior to the present study. This implies that memory of TVI training may 
last longer than expected. This finding emphasizes the point that at least some 
level of TVI training can potentially mitigate fratricide. Furthermore, continued 
training over a warfighter’s career can reinforce TVI skill level. 

We did find significant results in training duration with respect to 
performance on the TVI post-tests. As time became longer for participants to 
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complete the TVI training, their post-test scores decreased. While this was a 
significant finding, it may not necessarily mean that training programs need to be 
short in duration. Initial entry Warfighters with no previous knowledge of military 
vehicles may need longer times to achieve minimum proficiency. However, if 
Warfighters need longer periods of self-guided training modules on ROC-V, it 
may indicate they have other problems with memory. This could mean that 
commanders will have to shape training programs to ensure total unit readiness. 

The experimental design allowed for analysis of memory between 
recognition of thermal and visible images. The significant difference in 
performance between thermal and visible images is telling. Lower performance 
on the thermal image set emphasizes the point that more attention needs to be 
placed on these images in training plans. As the military continues to rely on 
infrared technology to conduct combat operations, recognition of thermal images 
continues to be low. The different cues that thermal imagery present are more 
difficult to recognize than visible imagery and necessitate increased attention to 
these differences. 

B. RECOMMENDATIONS FOR FOLLOW-ON RESEARCH 

Future research can expand our understanding of the effect of memory on 
TVI task performance. Increasing the amount of time between training and 
testing may reveal a significant impact of time on memory decay for a recognition 
task. While the training provided by ROC-V is robust and effective, the multiple- 
choice post-test is not representative of a real-world task. Tank gunners are not 
presented with options for the type of target in a tank gunner’s sights and then 
asked to select the correct response. Future research using ROC-V should 
analyze “shoot/don’t shoot” scenarios where participants have to decide whether 
the vehicle is a threat or not. Another study should investigate participants’ 
ability to recall names of vehicles when presented only with an image of the 
vehicles. This would allow for more detailed analysis on the effect of memory 
decay on the TVI task. Future research must not only address the effects of 
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memory decay over time, but also on effective ways to train and test thermal 
images to reach higher levels of proficiency. Follow-on studies should analyze 
retraining intervals in order to determine effective refresher training schedules. 
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APPENDIX A. RECOGNITION MEMORY TEST 


The recognition memory test used in this study was derived using 
Warrington’s (1984) framework. 


You are about to take part in a test that measures 
your ability to remember images. You will be shown 
a sequence of images to remember. Immediately 
afterward, you will see a screen with 36 images 
randomly arrayed. Write down the names of the 
images you recognize. When you have written down 
all of the images you can remember, turn your 
answer sheet upside down. 



47 







48 







49 





50 





51 





52 








53 





54 




APPENDIX B. DEMOGRAPHIC QUESTIONNAIRE 


1. What is your age? 

2. What is your branch of service? 

3. What is your military occupational specialty? 

4. What is your rank? 

5. How many years in service do you have? 

6. What is your visual acuity? 

7. Have you ever received fighting vehicle identification training? 

7a. If yes, when was the last time? 
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APPENDIX C. PARTICIPANT SLEEP LOG 


Participant Number 


Sleep and Activity Log 


Example: 

The example below shows how to appropriately fill out this sleep log Boxes containing 'N' indicate periods when a nap occurred, while boxes 
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Log: 

Please mark in the log below all sleep and nap periods for the 72 hours prior to your second test date. Estimate sleep and wake times to the nearest 
30-minute interval. See the example above for clarification. 
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Figure 10. Participant Sleep Log 
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